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Abstract. — The complete amino acid sequence of a human 7GI immunoglobu- 
lin (Eu) has been determined and the arrangement of all of the disulfide bonds 
has been established. Comparison of the sequence with that of another myeloma 
protein (He) suggests that the variable regions of heavy and light chains are 
homologous and similar in length. The constant portion of the heavy chain con- 
tains three homology regions each of which is similar in size and homologous to 
the constant region of the light chain. Each variable region and each constant 
homology region contains one intrachain disulfide bond. The half-cystines 
participating in the interchain bonds are all clustered within a stretch of ten 
residues at the middle of the heavy chains. 

These data support the hypothesis that immunoglobulins evolved by gene 
duplication after early divergence of V genes, which specified antigen-binding 
functions, and C genes, which specified other functions of antibody molecules. 
Each polypeptide chain may therefore be specified by two genes, V and C, which 
are fused to form a single gene (translocation hypothesis). The internal homol- 
ogies and symmetry of the molecule suggest that homology regions may have 
similar three-dimensional structures each consisting of a compact domain which 
contributes to at least one active site (domain hypothesis). Both hypotheses 
are in accord with the linear regional differentiation of function in antibody 
molecules. 



Antibodies or immunoglobulins can interact with a wide range of different 
antigenic determinants and, after specific binding to an antigen, they play a 
fundamental part in physiological functions of the immune response. The 
specificity of antigen binding depends ultimately upon amino acid sequences of 
the variable or V regions of antibody molecules. It is the diversity of these 
sequences which results in the range of specificities required for a selective im- 
mune response. In contrast, other regions of the antibody molecule have rela- 
tively constant sequences and are responsible for physiological functions. Like 
enzymes, these C regions appear to have evolved for a restricted set of interac- 
tions. This unusual picture of intramolecular differentiation has emerged from 
studies of the structure of immunoglobulins from different animal species. 1 To 
date, only portions of immunoglobulin molecules have been subjected to amino 
acid sequence determination. 

We now report the amino acid sequence of an entire human 7GI immuno- 
globulin (molecular weight 150,000), the location of all disulfide bonds, the 
arrangement of light and heavy chains, and the length of the heavy chain V 
region. 
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Materials and Methods, — The isolation of the myeloma protein Eu 2 and the preparation 
of its CNBr fragments 8 ' 4 have l>een described. Similar methods were used for the isola- 
tion of the yGI myeloma protein He and for the preparation of its CNBr fragments. 

We have previously described the methods used for enzymatic digestion with trypsin, 
chymotryp8in, and pepsin, gel filtration, ion exchange chromatography, high voltage paper 
electrophoresis, determination of NHrterminal and COOH-terminal residues, amino acid 
analysis, and determination of amino acid sequences by the dansyl-Edman procedure. 4 " 6 

The positions of glutamine and asparagine were assigned 9 by determining the electro- 
phoretic mobility of peptides and by amino acid analysis of the peptides after enzymatic 
hydrolysis. The half-cystinyl residues contributing to each intrachain disulfide bond 
were determined 10 using the diagonal electrophoresis method. 11 

Results. — The organization of the whole molecule is shown in Figure 1; an un- 
equivocal proof of the arrangement of the two identical light chains and two 
identical heavy chains has already been given. 4 Each light chain is linked to its 
neighboring heavy chain by a disulfide bond between corresponding half- 

Fab(t) 

Heovy ^ m^^mm c H ' 

ChQin T" 8 <rmrftn — _ , r 



COOH COOH 

FcO) 



•s s- 



s B SH 



V" 1 



Fob(l) 



3 PCA 



s s— 



mmmm. 



Fio. 1. — Over-all arrangement of chains and disulfide bonds of 7GI immunoglobulin 
Eu. Half-cystinyl residues are numbered I-XI; numbers I-V designate corresponding 
residues in light and heavy chains. PCA: pyrollidonecarboxylic acid. CHO: carbo- 
hydrate. "Fab(t)" and "Fc(t)" refer to fragments produced by trypsin, which cleaves 
the heavy chain as indicated by dashed lines above half-cystinyl residues VI. Vh, Vl: 
variable regions of heavy and light chains, Cl: constant region of light chain. ChI, 
Ch2, Ch3: homology regions comprising Ch or constant region of heavy chain. 

cystines V. Half-cystines VI and VII form bonds linking the half-molecules via 
the heavy chains. Trypsin cleaves the molecule at lysyl residue 222 to form two 
Fab(t) and one Fc(t) fragments. 2 - 6 

There are several strikingly linear arrangements in the primary structure. 
From their amino termini to half-cystines V, the light and heavy chains can be 
aligned or put in register. The intrachain disulfide bonds are linearly and period- 
ically disposed. 12 * 13 In accord with the alignment of light and heavy chains, 
corresponding intrachain disulfide bonds are in similar positions and the disulfide 
loops are of approximately the same size. 

Previous studies 7 have suggested that V regions of light and heavy chains have 
similar lengths and begin at the NHr-termini; this will be confirmed below. 
The C L region of the light chain has the same length as V L , but the C H region of 
the heavy chain is about three times as long. Ch may be divided into three 
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homologous regions of approximately equal length : ChI » C H 2, and Ch3 (Fig. 1) . 

We have already reported the amino acid sequence of the first 87 7 and the last 
224 residues 8 of the heavy chain as well as the partial sequence of the entire light 
chain. 6 The complete amino acid sequence of the light chain (214 residues) is 
shown in Figure 2. Positions of the half-cystinyl residues may be compared 
with Figure 1 and the methionyl residues may be correlated with previous studies 
on the CNBr fragments of Eu. 3 « 4 The variable region extends through residue 
108. In accord with other studies, 1 valine 191 is related to the Inv specificity. 2 

The complete sequence of the heavy chain (446 residues) is presented in Figure 
3 which may be compared with Figure 2 for alignment with the light chain 
sequence. Isolation of a single glycopeptide 8 indicated that the polysaccharide 
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Fig. 2. — Complete amino acid sequence of the Eu light chain. Half-cystinyl residues are in 
boxes and methionyl residues are underlined. 

portion of the molecule is attached at Asx residue 297. 14 In a previous study 8 we 
have suggested that glutamyl residue 356 and methionyl residue 358 may be 
associated with Gm 1 specificities. The sequence of Eu (Gm 4+) between 
residues 211-252 can be compared with the partial sequence of immunoglobulin 
Daw 17 (Gm 4—). The presence of arginine in position 214 of Eu and lysine in a 
comparable position of Daw may be associated with their Gm 4 specificities. 18 
Of particular significance is the determination of the point at which Vh ends 
and C H begins. A CNBr fragment comparable to fragment H4 was isolated 
from myeloma protein He which has the same Gm specificity as protein Eu. The 
sequence of the amino terminal portion of the CNBr fragment from He differed 
from that of the H« fragment from Eu. 21 
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Fio. 3. — Complete amino acid sequence of the Eu heavy chain. Half-cystinyl residues are in 
boxes and methionyl residues are underlined. 

A comparison of the sequence of the two fragments from residues 101 to 121 
is given in Figure 4. The sequences become identical at residue 115 (Eu num- 
bering). Further studies 21 confirmed that from residue 115 to residue 252 the 
sequence of the He fragment was identical to that of Eu H4. In addition, 
tryptic fingerprints of the Fc fragments from Eu and He were identical. These 
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Fi(i. 4.— Comparison of the amino acid sequence of the Eu heavy chain from residue 101-121 
with the corresponding sequence of the heavy chain of myeloma protein He. 

data suggest that the transition between V H and C H is located in the vicinity of 
residue 114 (Eu numbering). Studies on a number of additional proteins and a 
search for V H region subgroups 7 will be required to locate this point definitively. 

Discussion. — The present studies provide proof of the covalent structure and 
arrangement of chains in 7GI immunoglobulin. The half-molecule of Eu is the 
largest protein unit (446 + 214 residues) for which a complete amino acid 
sequence has been determined. In a protein of this size, one cannot neglect the 
possibility of small errors in sequence assignment; for this reason we are carrying 
out a number of further checks using various methods of peptide cleavage and 
fractionation. 

One of the most striking features of the immunoglobulin molecule that emerges 
from the completed sequence is the sharp demarcation of its polypeptide chains 
into linearly connected regions that arc associated with different functions. 
Variatioas in the sequences of paired Vh and Vl regions for the function of 
antigen binding in the selective immune response, and at the same time, conser- 
vation of sequence in C H and C L regions for other immunological functions appear 
to require special genetic and evolutionary mechanisms. 22 

The amino acid sequence of Eu provides convincing evidence that the immuno- 
globulin molecule evolved by successive duplication of precursor genes. 23 * 24 Our 
analysis of a complete heavy chain has revealed an additional homology region 
(C H 1), the structure of which was not previously known. Earlier comparisons 7 - 8 
of polynucleotide sequences corresponding to both chains of Eu showed evidence 
of homology between V H and V L and homologies among Cl, C h 2, and C H 3. A 
complete comparison of the amino acid sequences of Cl, ChI, C h 2, and C H 3 is 
given in Figure 5. In a stretch of 100 residues, any two regions are identical in 
29 to 34 positions. It is noteworthy that the stretch in the heavy chain from 
residue 221 to 233 which contains the interchain disulfide bonds 5 has no homol- 
ogous counterpart in other portions of heavy or light chains. 

In the data accumulated so far, little or no homology has been found between 
V and C regions. This prompts the speculation that V genes and C genes 
diverged early in the evolution of antibodies to serve two major groups of func- 
tions: antigen recognition functions (ARF) and effector functions (EF) such as 
interaction with cells and complement. The order of emergence of Cl genes or 
C H genes from a precursor gene is not apparent from the data. A comparison 
with sequences of /*> ot, 5, or e chains may indicate similarities in their V H regions 25 
and may show whether any of the homology regions are conserved in constant 
regions of heavy chains of these classes. 26 

Early evolutionary divergence of V and C genes is consistent with the evi- 
dence 27 - 28 that each chain is specified by two genes, V and C, and the hypothe- 
sis 7 - 22, 29 that V gene episomes are translocated to C genes to form a single VC 
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Fig. 5. — (Comparison of the amino acid sequence of Cl ( Cal, Ch2, and Ch3. Deletions indi- 
cated by dashes have been introduced to maximize the homology. Identical residues are darkly 
shaded; both dark and light shading are used to indicate identities which occur in pairs in the 
same positions. 



gene in lymphoid cell precursors. Dreyer and Bennett 30 have previously sug- 
gested a translocation of C genes and, more recently, this has been abandoned in 
favor of a detailed "copy-splice" mechanism. 31 Translocation of genes may be the 
basis of the phenomena of clonal expression and allelic exclusion in antibody 
production. Irreversible differentiation and commitment of a lymphoid pre- 
cursor cell may thus occur at the time of gene translocation. 

The alignment of disulfide bonds, the arrangement of symmetry axes, and the 
fact that proteolytic enzymes cleave the molecule to produce Fab, Fc and Fc' 
fragments 32 suggest that each homology region may be folded in a compact 
domain 29 stabilized by a single intrachain disulfide bond and linked to neighbor- 
ing regions by less tightly folded stretches of polypeptide chain. Such domains 
would have similar but not identical tertiary structures, and each domain would 
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contribute to at least one active site mediating a function of that class of im- 
munoglobulin. This domain hypothesis is consistent with the hypothesis that 
the molecule evolved by gene duplication as well as with the translocation 
hypothesis. As mentioned above, comparison of the structure and function of 
Ch regions in different immunoglobulin classes should reveal whether addition or 
deletion of homology regions and corresponding domains is a major mechanism 
in the evolution of these classes. 

Support for the domain hypothesis would come from finding that limited 
proteolysis of Fab fragments yields fragments containing halves of the Fd frag- 
ments. Similar treatment of Bence-Jones proteins may produce Vl and Cl 
fragments. 38 Additional evidence may come from location in Ch of sites for 
complement fixation and skin fixation. Final proof or disproof of this hypothesis 
obviously rests on the results of X-ray crystallographic analysis. It is clear 
that a rotation axis passes through the disulfide bonds linking the heavy chains 
and there may be an axis of pseudosymmetry between the light and heavy chains. 
The locations of the interchain and intrachain disulfide bonds, the extensive 
homologies, and the alignment of the light and heavy chains with each other 
suggest that the overall relationships described above will be conserved in the 
three-dimensional structure regardless of the details of folding. 

The exact contribution of the variable regions to the antigen combining site 
must also await analysis of the three-dimensional structure. It is known that the 
Fab fragment contains both Vl and V H regions, and affinity-labeling experi- 
ments 24 indicate that tyrosyl residues 34 - 35 in these regions are directly involved 
in antigen binding. The constancy of the disulfide bonds in V H and V L regions 
and their coordinate location suggest the possibility that the site is fixed by these 
bonds and that variations in the branches of a chain connected by each bond 
may be sterically arranged around the bond as a center. The closely homologous 
Cl and C H 1 regions may serve additionally to stabilize the structure in the face of 
the variation, so that both Vl and Vh can participate in the site. 

In contrast to the diversity of V regions, the origin of which is still unknown, 22 
the C regions appear to be quite stable in various animal species. Recent 
studies 8 show striking resemblances in the Fc portions of rabbit 23 and human 
7G immunoglobulin. In this respect, C regions, like enzymes, may have evolved 
to interact with specific molecules, e.g., those of the complement system. The 
presence of genetic differences in C regions has not so far been related to their 
function, but there is no reason to expect that the origin of variations in C regions 
will differ from other genetic polymorphisms. 

We are deeply indebted to Dr. Jack Brook for supplying Eu plasma. We are grateful 
to Miss Joan Low, Miss Catherine Volin, and Mrs. Helvi Hjelt for their expert technical 
assistance. 
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